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ON  THE  EQUIVALENCE  OF  INFORMATION  STRUCTURES 
IN  STATIC  AND  DYNAMIC  TEAMS 


Y.  C.  Ho 
Pierce  Hall 
Harvard  University 
Cambridge,  Massachusetts 


K.  C.  Chu 

Systems  Control,  Inc. 
260  Sheridan  Avenue 
Palo  Alto,  California 


In  team  theory,  we  visualize  N  decision  makers  (DM)  or  team 
members  each  receiving  individual  information.  There  are  two  important 
classes  of  information  structures:  (i)  each  DM's  information  only  depends 
on  certain  primitive  random  variables  £  of  the  external  world  (ii)  each 
DM's  information  depends  not  only  on  £  but  also  the  actions  taken  by 
other  DMs.  The  first  class  can  be  called  STATIC  team  theory  or  static 
information  structure  and  the  second  DYNAMIC  in  the  sense  that  the  time 
order  in  which  the  DMs  act  is  irrelevant  in  the  first  case  while  causality 
plays  a  definite  role  in  the  second.  DM  who  acts  later  obviously  cannot 
affect  the  information  received  by  DM  who  acts  precedent  or  concurrent 
with  him.  Radner  [1]  has  established  certain  necessary,  and  sufficient 
conditions  for  optimality  of  the  decisions  of  the  DMs  for  a  wide  class  of 
static  teams.  Ho  and  Chu  [2]  solved  the  linear-quadratic-gaussian 
dynamic  team  problems  by  showing  their  equivalence  to  the  static  case 
under  certain  further  assumptions  on  the  information  structure  of  the 
team.  In  this  note  we  generalize  the  conditions  of  equivalence  to  non¬ 
linear  and  nongaussian  cases. 


i 


-2- 


Let  |  be  a  random  vector  defined  on  a  basic  probability  apace 
(ft,  &).  u  =  [uj  ,  u^ ,  . .  .  ,  u^]  be  the  decision  (control)  vector  of  the 
N  DMs.  Let  z ^  be  the  information  available  to  the  ith  DM, 

zi  =  etc.  ),  =  (1) 

where  rt.  are  assumed  to  be  measurable  functions  of  f  and  u,  etc.  In 
i  -j 

general  this  information  is  affected  by  the  decision  variables  of  some  of 

the  DMs  that  act  before  i.  Let  j  be  one  of  those  DMs  which  do  so,  we  say 

DM  j  is  a  precedent  of  DM  i  and  denote  it  by  j  4  i.  In  (1)  u.  represents 

J 

the  decisions  of  all  those  j  such  that  j  i  i,  For  causal  system,  there  is 
always  some  starting  DMs  without  any  precedents  for  them  and  their 
information  depends  only  on  £  .  The  precedence  relation  is  assumed  to  be 
fixed  in  advance  and  does  not  depend  on  random  events  and  decision  out¬ 
comes.  This  precedence  relation  is  generally  a  partial  ordering  [2],  We 
can  then  partition  the  N  DMs  into  the  following  disjoint  Bets: 

k 

c4'=  {1,2,...,N}  =  U 


where 


•A'^  -  {i|z^  =  the  set  of  starting  DMs, 

Az  =  {i|z.  =  rji(? ,  Uj)  ,  j  e  A]} 

k-1  k-1 

^k  =  =  n«(€,u.)  ,  j  e  U  ji'  ,  i  £  U  c/fc} 

K  1  1  i  i=l  1  f=l  £ 


The  decision  rule  (control  law)  of  DM  i  is  assumed  to  be  a  function  of  its 
own  information 


ui  =  Vzi> 


i  =  1,.  .  .N 


(2) 


where  y.  belongs  to  the  set  P  of  all  admissible  (Borel)  measurable 
functions  from  the  space  of  z.  to  the  space  of  u^.  Consequently  for  fixed 
V  =  ["Yi*  •  •  •  ,  y^])  zi  Educes  subfields  C  3T.  The  above  set  up  is  in 
complete  harmony  with  the  usual  dynamic  stochastic  system  statement  of 
a  team  problem.  For  example,  we  have, 


xt+l  =  £(xt’Ut’Wt)  t  =  1, .  .  .  ,  N 
yt  =  h(xt,  vt) 


(3) 

(4) 


where  x^w^v^  are  external  random  variables  playing  the  part  of  g  and 
{y  [t  e  subset  of  { 1,  .  .  .  ,  t)  }  =  z  are  the  information  available  to  u  . 

Tv  V 

The  point  is  that  the  intermediate  variables  x^  are  irrelevant  in  a 
decentralized  setting  and  can  always  be  eliminated  via  substitution.  Only 
u  and  5  and  the  information  structure  as  specified  by  (1)  count. 

Definition  1.  Information  z.  is  said  to  be  nested  in  z.  if  there  exists  a 

-  j  -  i 

measurable  function  f.j  from  the  space  of  z.  to  the  space  of  z ^  such  that 

zj  =  fij*V  V  ^  and  u  (5> 


Equivalently,  we  say  z.  is  nested  in  z^  if 

,'^c^  ¥  y  e  r=  Tjx^x.  .  .  xTn  (6) 

Definition  2.  An  information  structure  p  =  [p  .  .  .  ,  p  ]  is  said  to  be 
partially  nested  if  for  all  j,  i  j  1  i  implies  z.  is  nested  in  z... 

By  definition,  in  static  team  there  are  no  precedents  for  any  of  the 
decision  makers;  hence,  any  static  team  is  trivially  partially-nested. 

A  particularly  simple  and  common  form  of  partially  nested 


information  is 


wmijS 
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{Zj|j  **0 


z-  =  n!(C,Uj) 


In  other  words  if  u.  affects  the  information  received  by  the  ith  DM,  then 
the  ith  DM  will  also  know  what  the  jth  DM  knows  (as  part  of  z.).  In  this 
case,  h j  is  simply  a  row  matrix  of  {1, 0}  which  picks  out  certain 
components  of  z^. 

Definition  3.  Information  z.  is  said  to  be  equivalent  to  information  z.  if  z. 

and  Zj  are  mutually  nested,  i.  e.  f^  in  (5)  is  invertible.  The  eqvuvalent  z^ 

and  z.  referred  here  may  represent  the  information  of  two  different  DMs 

or  two  different  forms  of  information  available  to  one  DM.  In  the  latter 

case,  any  strategy  in  =  -y^(z^)  that  can  be  realized  by  z.  can  always  be 

duplicated  by  z.  and  vice  versa.  Yet  the  computation  of  strategies  based 
J 

on  one  information  structure  may  be  simpler  than  the  other  particularly  if 
the  latter  is  of  static  nature. 

In  the  theorem  below  we  shall  state  a  condition  on  the  equivalence 
between  dynamic  and  static  information  structure.  To  this  ends,  suppose 
we  can  write 


zi  =  r,.(S,u.)  (8) 

=  rjiirjjd  ),  Uj)  =  K-lfyd);  Uj)  for  fixed  m 

Theorem  1,  Given  a  team  with  partially  nested  information  structure  t], 
if  furthermore  (8)  is  true  and  is  invertible  for  all  i  and  m,  then  the 
information  structure  z^  =  $.(£)  is  equivalent  to  z^  =  »].(|,u.)  for  i  -  1,  2, 


•  •  > 


Proof:  It  is  clear  that  we  can  choose 


zi  =  =  i  (Nj  (i.  e.  =  I,  the  identity  function). 


Next  we  write  from  (8) 

ai  =  n/n/5),  Uj)  =  j  e  Nr 

i  c  N, 


Since  z.  =  z.  is  partially  nested,  is  a  known  quantity.  Hence/ we  can 

J  j  J  J 


write 


i  =  rjjte)  =  K."1^.;^.))  , 


i  e  t/f"  j  e  -A. 


and 


z.  =  K. (z, :  \.(z.))  , 
i  i* -xy  j"  > 


This  process  can  obviously  be  iterated  for  i  c  <sty  .  .  .  ,  The 

collected  equivalent  information  structure  *7  =  [ >7 ^ ,  .  .  .  ,  r)^]  is  a  static  one. 
Examples.  In  all  examples  below  we  assume  (7)  and  illustrate  particular 
useful  cases  of  (8). 


(i)  z. 


(*j|j  -i  i) 


z!  =  H.5+  E  D..u. 
.  1  1  .7,  11  1 


4=4 


Ujlni} 


A,  A 
z:  =  z.1  - 
L  1 


i  -  E  D..y.(z.)  =  H.5 
1  -^i  iJTr  J  1  - 


The  above  transformation  can  be  used  recursively  to  produce  a  set  of  z. 
which  are  linear  combinations  of  £  only.  This  trick  was  used  to  prove  the 
optimality  of  a  wide  variety  of  LQG  problems  with  partially  nested 
information  structure  [2], 
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:  / 


(ii) 


z.  = 
1 


{  I J  <  i} 


=  h.(€)  +  d.(u.)J 


Lzi  =  H  -  di^j<zj»  =  V?)J 


Example  (ii)  shows  that  one  of  the  crucial  requirements  for  equivalence  is 
that  the  effect  of  u  be  additive. 


(iii) 


z. 

l 


{  zj  1  j  i  1} 


[_z{  =  A.(u.)h.(S)  +  b.(u.) 


A^(Uj)  is  an  invertible  matrix  for  all  admissible  u,  then  again 


z. 


i  <=*> 


(  *  j  I  j  <  i) 


-1 


IH-  Ai  (Vzj))[zi  ■  Wj))]  =  hi(e)J 


The  example  shows  that  if  the  effect  of  the  previous  u  is  multiplicative 

then  invertibility  in  some  form  is  required.  For  example,  if  u.  turns  off 

J 

some  measurement  system  such  that  A.(m)  =  0  for  all  i  y  j  then  the  system 
may  still  be  partially  nested  but  not  equivalent  to  a  static  one. 

Examples  (i)  -  (iii)  can  be  combined  to  yield  the  following  assertions: 
Assertion  1.  The  dynamic  team  in  Figure  1  has  an  equivalent  static 


information  structure 


NLD  -  Nonlinear  Dynamic  System 


NL  -  Nonlinear  Transformation,  g 

ALD  -  Additive  Linear  Dynamic  I  -  Invertible  Transformation,  k 

Systems,  Eq.  {#) 


*t+l  =  §(ut>  +  A*t  +  Wt 


(*) 


zt  =  kt<  5V  Jt  * 


^t  +  V  .• 


|t  <  t} 


KpWj.,  vj.  are  externally  specified  random  variables. 


zt  ^ 


-  H  D  At"1g  (u  )  =  £; 
T  i=  1  11  ' 


{S'  !t  <  t} 


Figure  1 


N> 
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Assertion  2.  The  dynamic  team  in  Figure  2  has  an  equivalent  static 
information  structure  provided  (g^u^A)'*  exist  for  any  admissible  u^. 


w 


NLP 


NL 

■7K- 


MLP 


imic  Sy 


c 

vt 

NLD 

7 

^ - 

vt 

\ 

MLD  -  Multiplicative  Linear  Dynainic  System,  Eq.  (T) 
-  Multiplication  of  a  vector  by  a  scalar  ^ 

xt+l  r  gt(ut)AxtWt 


*  I 


J.I 

f) 


Vf 


- 


A  , 
vx  =  z! 
t  t  t 


L{ z;  It  <  *} 


v^,  wt  are  scalar  random  sequences  externally  specified  x^  is  a  random 


variable. 


zt  4=* 


r  jjj  (gi<ui)A)_i  zi  *■ 


L(=t!t  <  t} 


1 


Figure  2 


Finally,  we  have  the  easy  generalization  of  the  Radner  theorem. 


\ 
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Theorem  11.  Let  d  dynamic  team  have  partially  nested  information 
structure  p  which  is  also  equivalent  to,  a  static  one  rj .  Let  the 
performance  function  of  the  team  be 

•  J(\)  =  E{.s?[u,5]} 

where  §  is  a  primitive  random  vector  with  given  prior  probability 
distribution  p(£).  If  (a)  &  is  convex  and  differentiable  in  u^  for  all  i  a -id 
for  a.  e.  t;  ,  (b)  J(\)  is  locally  finite  and  (c)  Inf^J(y)  >  -  oo ,  then  there  is 
a  unique  local  optimal  decision  u^  -  (z^)  for  all  i.  Furthermore,  if 

J(v)  =  E 1 1  uTQu  +  UTS5  +  uTc)  (9)  ) 

where  Q,  S,  c  are  constants,  ?  is  gaussian  with'|finite  covariance,  r). 

*  A 

linear  in  §  for  all  i,  then  y.  is  linear  in  z.  and  possibly  nonlinear  in 
Also  we  have  the  extension 

Corollary.  W  p(£)  in  theorem  II  has  the  form  ^ 

p(S)  =  f[(?  -  m)TX" 1  (§  -  m)  ]  (10) 

,  .  \ 

then  the  optimal  decision  rule  to  (9)  is  again  linear  in  z^, 

Proof:  Density  functions  of  the  form  (10)  are  called  spherically 
(elliptically)  invariant,  of  which  gaussian  distribution  is  a  special  case. 
Vershik  [3]  and  Blake  and  Thomas  [4]  have  shown  that  all  mean- square 
estimation  problems  with  this  class  of  random  variables  have  linear 
solutions.  More  specifically,  the  Bayesian  conditional  mean  E[§  [z  =  H?  ] 
is  linear  in  z,  and  it  is  always  true  that 

>Je  r  - 

For  definition,  see  [1J. 

\ 

\ 


(11) 


E($[£)  =  m  +  XHT(HXHT)"1(£  -  Hm)  , 

when  5  is  spherically  invariant. 

There  is  no  loss  of  generality  by  letting  m  =  0.  If  we  assume  linear 
decision  rule 

u.  =  A  A  +  b.  (12) 


and  solve  for  the  optimal  and  b^,  i.  e.  ,  substitute  (11)  and  (12)  into 
Eq.  (19)  of  [2],  we  find,  that 

l  Q..A.(H.XH?)  =  -  S.XH?  Vi  (13) 

f  ij  j  j  i  xi  '  ' 


b 


T 


Tr,"l 

c  Q 


(14) 


Everything  remains  the  same  as  the  gaussian  case,  except  X  here  is  not 
necessarily  the  covariance  of  § . 


Conclusion 

The  matter  discussed  in  this  report  concerns  the  reduction  of 
information  structures  in  dynamic  team  decision  making  to  static  cases. 
All  the  DMs  are  assumed  to  be  rational.  Partially-nested  information 
structure  is  a  system  in  which  the  DM  knows  what  his  precedents  have 
known  and  by  this  he  deduces  what  they  have  done.  Based  on  this  concept, 
the  condition  on  information  equivalence  has  been  derived.  The  essence 
of  information  obtained  may  be  disguised  in  various  forms  in  many 
dynamic  team  problems.  Our  theorems  clarify  the  information  structure 
in  several  classes  of  problems. 
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ON  THE  RELATIVE  LEADERSHIP  PROPERTY 


OF  STACKELBERG  STRATEGIES 
By 

Tamer  Basar 

Division  of  Engineering  and  Applied  Physics 
Harvard  University  •  Cambridge,  Massachusetts 

ABSTRACT 


The  relative  leadership  property  of  Stackelberg  strategies  has  been 
investigated  via  a  scalar  nonzero-sum  two-person  differential  game 
problem.  It  is  shown  that,  depending  on  the  parameters  of  the  game, 
there  exist  three  different  types  of  solutions  for  this  class  of  games. 
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Introduction 

In  a  recent  paper  [Ref.  1]  Simaan  and  Cruz  have  obtained  the  open- 
loop  Stackelberg  solution  for  a  class  of  deterministic  nonzero- sum  two- 
person  games  under  the  leadership  of  one  of  the  players.  One  of  the 
properties  of  the  Stackelberg  strategies,  as  discussed  in  Ref.  1,  is  that 
if  one  of  the  players  acts  as  the  leader  in  the  game  then  both  players 
might  benefit  from  this  leadership  in  the  sense  that  both  of  them  might 
end  up  with  better  payoffs  than  the  ones  obtained  from  the  Nash  strategic  at 
This  property  of  the  Stackelberg  solution  brings  up  the  question  of  as  how 
to  decide  on  which  player  should  lead  the  game  and  which  player  should 
follow.  Further,  there  is  the  question  of  whether  it  is  always  profitable 
for  either  player  to  act  as  the  leader  rather  than  be  the  follower. 

In  this  note,  we  address  ourselves  to  these  questions  via  a  scalar 
nonzero- sum  differential  game  problem  which  is  related  to  Example  5  of 
Ref.  1.  We  show  that  there  are  situations  in  which  a  player  would  prefer 
to  be  the  follower  rather  than  be  the  leader  and  that  this  leads,  in  general, 
to  three  different  types  of  solutions  for  this  class  of  games.  The 
concept  introduced  and  the  conditions  derived  for  the  scalar  example 
can  readily  be  extended  to  encompass  a  more  general  class  of  nonzero- 
sum  game  problems. 


+It  is  important  to  note  that  we  are  restricting  ourselves  to  noncooperative 
solution  concepts.  Otherwise,  pareto-optimal  solution  should  be 
considered  in  making  the  comparison. 


v 
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A  Linear  Quadratic  Differential  Game 

Consider  the  following  generalized  form  of  Example  5  of  Ref,  1. 
The  dynamics  are  described  by  the  scalar  linear  differential  equation 


X  =  Uj  -  U2  ,  X{0)  =  Xq  , 


(1) 


where  and  are  controlled  by  player  1  and  player  2  respectively 
and  are  measurable  functions  of  t  e  (0,  1]  and  x^.  The  costs  to  players 
1  and  2  are  given  by  the  quadratic  payoff  functions  and  J2  respectively, 
where 

1 

Ji(ui  •  u2> =  i  ri  xf +  2 r  /  U 1  dt  »  (2a) 

P  0 

1 

J2(ul  *  u2>  =  i  c2  xf  +  2T"  /  u 2  dt  »  (2b) 

e  0 

0^  >  0  f  ce  >  0  ,  Cj  /  0  ,  c2  4  0  ,  and  x^  denotes  the  terminal  state  (i,  e. 
x(l)).  Note  that  this  formulation  becomes  identical  to  Example  5  of  Ref,  1 
when  Cj  =  1,  c2  =  -  1  . 

Now  denote  the  Stackelberg  control  of  the  i'th  player  when  the  j'th 

player  is  the  leader  by  u.  .  and  the  corresponding  Stackelberg  payoff 

isj 

J.(u,  ,  ,  u,  .)  by  J1!  .  Then  it  follows  from  Eqs.  (48)  -  (53)  of  Ref,  1 
i  lsj  *  2sj  7  l  ~ 

that  the  open -loop  Stackelberg  solution  with  player  2  as  the  leader  is  given 
by 


ls2 


c c  ( 1  +  c ,  c  ) 

Ip  Ip  „ 

,u  ,2  x0 

( l+c, c  )  +  c_c 

'  1  p'  2  e 


(3a) 
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c  c, 
e  2 


2s2 


(1+CjCp)  +c2ce 

under  the  conditions 


(3b) 


(1  +  CjCp)  >  0  (4a) 

(i  +  CjCp)2  +  c2ce  >  0  ,  (4b) 

and  the  corresponding  Stackelberg  payoffs  are 


J 


2 

1 


1 

2 


cl(UclcP)3  2 

t<1+ciV2  +  C2Ce]2  X° 


[(i+CjCp)  +  c2ce] 


(5a) 


(5b) 


Using  a  symmetry  property  of  the  original  differential  game,  the 
open-loop  Stackelberg  solution  with  player  1  as  the  leader  can  readily  be 
obtained  from  (3a)  -  (5b),  i.  e. 


lsl 


c.  c 

—i-B 


<1+c2ce)2  +  clcp 


c2ce(l  +  c2ce) 

^si  =  777  XT  xo 

(l+C-C  )  +  C.  c 

2  e'  1  p 


under  the  conditions 


(6a) 


(6b) 


(1  +  c2ce)  >  0 


(7a) 
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(1  +  cec2)2  +  clGp  >  0 

and  the  corresponding  Stackolberg  payoffs  are 


(7b) 


jl  -  I _ CJ _  x2 

1  2  [(l+c2c/  +  clCp]  0 

1  ^  c2(Uc2Ce^ _  2 

2  =  2  ~  v2  ,2  X0 

[<1+c2ce)  +  clcpJ 


(8a) 


(8b) 


Denoting  the  Nash  payoffs  of  players  1  and  2  by  J  ^ 
respectively,  it  is  certainly  true  that 


and 


J 


1 

1 


J2  < 


(9) 


that  is,  the  leader  will  always  do  better  (in  the  sense  of  achieving  a  lower 
payoff)  than  his  Nash  solution.  [The  inequalities  in  (9)  are  strict  because 
of  the  assumption  ^  0,  c^,  4  0.  ]  However,  relation  (9)  does  not 
necessarily  imply  that  the  best  each  player  can  do  (in  a  noncooperative 
sense)  is  to  be  the  leader  in  the  game.  In  fact,  such  a  statement  will  not 
always  be  true  as  will  be  shown  in  the  sequel. 

1  2* 

In  order  to  derive  the  conditions  under  which  <  Jj  and/or 
2  1 

>  we  will  first  have  to  require  relations  (4a),  (4b),  (7a)  and  (7b)  to 
1  2 

be  satisfied.  Then  implies  (after  some  straightforward  but 

extensive  manipulations)  either 


(i)  c2  >  0 


(10a) 
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or 


(ii) 


c2  <  °  , 


c2ce 


<  1  + 


2+c.  c 
1  P 

d+clCp)5 


(10b) 


That  is,  if  either  (10a)  or  (10b)  is  satisfied,  player  1  can  achieve  the 
lowest  possible  payoff  (in  a  noncooperative  sense  and  assuming  that 
player  2  acts  rationally)  by  being  the  leader  in  the  game.  [Note  that  if 
,  i  I  j,  then  player  i  achieves  the  same  payoff  by  being  either  the 
leader  or  the  follower.  In  such  a  paradoxial  situation  we  assume  that  he 
acts  as  a  leader.  ] 

Similarly,  the  conditions  under  which  player  2  would  rather  prefer 
2  1 

to  be  the  leader  (i.  e.  <  J2)  are  either 


(iii) 


cl>0 


or 


(iv)  c,  <  0  , 


c,  c 
1  P 


<  1  + 


2+c2ce 

(l+c2ce)‘ 


(11a) 


(lib) 


provided  that  relations  (4a),  (4b),  (7a)  and  (7b)  are  also  satisfied. 

To  summarize  these  results  and  to  indicate  their  immediate 
implications  in  a  compact  form,  denote  the  set  of  Cj  4  0,  c^  [  0,  cp  >  0, 
c  >  0  which  satisfy  (4a),  (4b),  (7a)  and  (7b)  by  £2.  Further,  denote  the 
quadruple  {  Cj,  c2,  c  ,  ce  }  by  a.  Let  be  the  set  of  a  eft  which 
satisfy  either  (10a)  or  (10b),  and  F 2  be  the  set  of  a  £  O  which  satisfy  either 
(11a)  or  (lib).  Then  we  have  the  following  conclusions.  (Note  that  any 
given  a  specifies  the  game  completely). 


(i)  rx  c  n,  r2  Cfl 
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(2)  Tj  O  ^  4  4  (this  will  be  proven  in  the  Bequel  via  a 

numerical  example  -  Example  2) 

(3)  (£1-1^)  O  4  4  (this  will  also  be  proven  in  the  sequel 

via  a  numerical  example  -  Example  4) 

(4)  Player  i  wants  to  be  the  leader  iff  a  e  T  . 

(5)  Player  i  wants  to  be  the  follower  iff  a  £  Q-R  , 

(6)  Both  players  want  player  i  to  be  the  leader  iff  a  c  r.  n(fl-r\), 

i  \  i- 

(7)  Either  player  wants  himself  to  be  the  leader  iff  a  e  Fj  fl  . 

(8)  Either  player  wantB  himself  to  be  the  follower 

iff  a  «■  (n-rj)  r  (n-rz) . 

Hence,  associated  with  the  nonzero-sum  differential  game  considered 
in  this  note,  we  have  3  different  types  of  solutions,  depending  on  the 
parameters  defining  the  game; 

TYPE  A:  -  CONCURRENT  SOLUTION 

If  a  c  r.n(fi-r\),  i  4  j,  it  follows  from  item  (6)  that  the  players 
mutually  benefit  from  the  leadership  of  the  i'th  player  and  hence  they 
"collectively"  decide  to  play  the  game  under  player  i'th  leadership  (even 
though  it  is  a  noncooperative  game).  We  call  this  a  concurrent  solution, 
since  there  is  no  reason  for  either  player  to  deviate  from  the  corresponding 
Stackelberg  solution  which  was  computed  under  mutual  agreement. 

TYPE  B:  -  NONCONCURRENT  SOLUTION 

If  a  e  Fj  n  either  player  knows  that  he  rill  do  best  (in  the 
noncooperative  sense)  if  he  himself  is  the  leader  (item  7).  Hence  either 
player  will  try  to  announce  his  strategy  first  and  thus  force  the  other 


player  to  pick  the  Stackelberg  strategy  under  his  leadership.  In  this  case, 
the  one  who  can  process  his  data  faster  will  certainly  be  the  3eader  and 
announce  his  policy  first.  However,  if  the  "slower"  player  does  not 
actually  know  that  the  other  player  can  process  his  data  faster  than  he  doeB 
and/or  if  there  is  a  delay  in  the  information  exchange  between  the  two 
players  (which  is  the  case  in  many  economic  situations),  then  he  might 
tend  to  announce  a  Stackelberg  strategy  under  his  own  leadership  quite 
unaware  of  the  announcement  of  the  other  player;  this  certainly  results  in 
a  nonequilibrium  situation. 

TYPE  C:  -  STALEMATE  SOLUTION 

If  a  e(fi-Fj)  n  (ft-^),  then  neither  player  wants  to  be  the  leader 
(item  8).  Both  players  will  rather  prefer  to  wait  for  the  opponent  to 
announce  his  policy  first  --  which  will  result  in  a  "stalemate".  In  order 
to  come  up  with  a  reasonable  solution  for  this  case,  one  has  to  introduce 
some  negotiation  or  bargaining  between  the  players.  The  question  of  the 
existence  and  nature  of  the  bargaining  procedure  that  would  result  in  a 
concurrent  solution  is  yet  an  open  problem  that  requires  further 
investigation. 

We  next  consider  numerical  examples  to  illustrate  these  three 
different  types  of  solutions. 

Example  ,1 

a  =  |  Cj  =  1,  c2  -•  -1,  0^  =  1,  cg  =  0.  5 } 

It  can  readily  be  checked  that  ae^  fl  (fi-F^),  and  hence  this  example 
admits  a  Type  A  solution  with  player  Z  being  the  leader.  This  example 
can  also  be  considered  as  a  velocity- controlled  pursuit-evasion  game  of 


the  nonzero- sum  variety  in  which  the  pursuer  (player  1)  has  less  weight 

on  his  control  than  the  evader  (player  2)  [i.  e.  Under  this 

cp  ce 

condition  the  pursuer  would  prefer  to  wait  and  act  second,  and  the  evader 
would  rather  prefer  to  act  first. 

Different  Stackelberg  payoffs  for  this  example  are 


jj  =  0.400  Xq 
Jj  =  0.  326  Xq 


J*  =  -  0. 040  Xq 

4-  -  °- 143  4 


Example  2 

a  =  jcj  =  1,  c2  =  -  1,  cp=  0.2,  ce  =  0.8} 


For  this  game  q  e  Tj  f>  ^  and  hence  it  admits  a  Type  B  solution,  i.  e. 
neither  player  1  (the  pursuer)  nor  player  2  (the  evader)  want  to  be  the 
follower.  Note  that  the  only  difference  of  this  example  from  the  previous 
one  is  that  now  the  pursuer  has  more  weight  on  his  control  than  the  evader 
Different  Stackelberg  payoffs  for  example  2  are 

jj  =  2.08  Xq  J2  =  ”  °*  069  x0 

4  =  2-0*0  4  =  -°-7s  4 

Example  3 

a  ---  {  Cj  =  1,  c2  =  1,  cp  =  1,  ce  =  0.  5} 


This  example  also  admits  a  Type  B  solution,  i,  e.  a  e  Tj  O  Different 
Stackelberg  payoffs  for  this  game  are 


j}  =  0. 154  Xq 


J*  =  0.  160  xj 
Jj  =  0.  198  Xq  J2  =  0.  lllxjj 

Example  4 

Q  =  jcj  =  -1,  C2  =  "I,  Cp  =  Ce  = 


For  this  final  example,  a  e  (Cl-  T^)  O  (ft-F^)  and  hence  it  admits  a  Type  C 
solution.  Both  players  want  to  be  the  follower  and  this  leads  to  a 
"stalemate"  solution.  One  has  to  introduce  some  cooperation  between 
the  players  in  order  to  derive  a  concurrent  solution  (if  such  a  solution 
exists).  Different  Stackelberg  payoffs  for  this  example  are 


J 

J 


1 

1 

2 

1 


4.  5  x' 

12  x 


'2 

.2 


"  12  xo  ■ 

-  4-  5  xo 


I 

\ 
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